Scaling up all pairs similarity search pdf

ثبت نشده
چکیده

Given a large collection of sparse vector data in a high dimensional space, we investigate the problem of finding all pairs of vectors whose similarity.ABSTRACT. Given a large collection of sparse vector data in a high dimensional space, we investigate the problem of finding all pairs of vectors whose similarity. Scaling up all pairs similarity search, Published by ACM. The problem of finding all pairs of vectors whose similarity score as determined by a function such as cosine distance is above a given threshold.This problem is also known as the similarity join. Scaling Up All-Pairs Similarity Search. Download from: http:www.bayardo.orgpswww2007.pdf. All pairs similarity search is used in many web search and.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scaling Out All Pairs Similarity Search with MapReduce

Given a collection of objects, the All Pairs Similarity Search problem involves discovering all those pairs of objects whose similarity is above a certain threshold. In this paper we focus on document collections which are characterized by a sparseness that allows effective pruning strategies. Our contribution is a new parallel algorithm within the MapReduce framework. The proposed algorithm is...

متن کامل

Scaling up top-K cosine similarity search

Article history: Received 21 September 2009 Received in revised form 23 August 2010 Accepted 23 August 2010 Available online 8 September 2010 Recent years have witnessed an increased interest in computing cosine similarity in many application domains. Most previous studies require the specification of a minimum similarity threshold to perform the cosine similarity computation. However, it is us...

متن کامل

Scaling of solar wind epsilon and the AU , AL and AE indices

Scaling of solar wind epsilon and the AU, AL and AE indices. Abstract. We apply the finite size scaling technique to quantify the statistical properties of fluctuations in AU, AL and AE indices and in the ǫ parameter that represents energy input from the solar wind into the magnetosphere. We find that the exponents needed to rescale the probability density functions (PDF) of the fluctuations ar...

متن کامل

Scaling of solar wind ǫ and the AU , AL and AE indices as seen by WIND

Scaling of solar wind ǫ and the AU, AL and AE indices as seen by WIND. Abstract. We apply the finite size scaling technique to quantify the statistical properties of fluctuations in AU, AL and AE indices and in the ǫ parameter that represents energy input from the solar wind into the magnetosphere. We find that the exponents needed to rescale the probability density functions (PDF) of the fluct...

متن کامل

MM-MDS: A Multidimensional Scaling Database with Similarity Ratings for 240 Object Categories from the Massive Memory Picture Database

Cognitive theories in visual attention and perception, categorization, and memory often critically rely on concepts of similarity among objects, and empirically require measures of "sameness" among their stimuli. For instance, a researcher may require similarity estimates among multiple exemplars of a target category in visual search, or targets and lures in recognition memory. Quantifying simi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015